A New Distribution Family for Microarray Data †
نویسندگان
چکیده
The traditional approach with microarray data has been to apply transformations that approximately normalize them, with the drawback of losing the original scale. The alternative stand point taken here is to search for models that fit the data, characterized by the presence of negative values, preserving their scale; one advantage of this strategy is that it facilitates a direct interpretation of the results. A new family of distributions named gpower-normal indexed by p∈R is introduced and it is proven that these variables become normal or truncated normal when a suitable gpower transformation is applied. Expressions are given for moments and quantiles, in terms of the truncated normal density. This new family can be used to model asymmetric data that include non-positive values, as required for microarray analysis. Moreover, it has been proven that the gpower-normal family is a special case of pseudo-dispersion models, inheriting all the good properties of these models, such as asymptotic normality for small variances. A combined maximum likelihood method is proposed to estimate the model parameters, and it is applied to microarray and contamination data. Rcodes are available from the authors upon request.
منابع مشابه
Integration and Reduction of Microarray Gene Expressions Using an Information Theory Approach
The DNA microarray is an important technique that allows researchers to analyze many gene expression data in parallel. Although the data can be more significant if they come out of separate experiments, one of the most challenging phases in the microarray context is the integration of separate expression level datasets that have gathered through different techniques. In this paper, we prese...
متن کاملOn a New Bimodal Normal Family
The unimodal distributions are frequently used in the theorical statistical studies. But in applied statistics, there are many situations in which the unimodal distributions can not be fitted to the data. For example, the distribution of the data outside the control zone in quality control or outlier observations in linear models and time series may require to be a bimodal. These situations, oc...
متن کاملA New Method for Generating Continuous Bivariate Distribution Families
Recently, it has been observed that a new method for generating continuous distributions, T - X family, can be quite effectively used to analyze the data in one dimension. The aim of this study is to generalize this method to two dimensional space so that the marginals would have T - X distributions. So, several examples and properties of this family have been presented. As ...
متن کاملModification of the Fast Global K-means Using a Fuzzy Relation with Application in Microarray Data Analysis
Recognizing genes with distinctive expression levels can help in prevention, diagnosis and treatment of the diseases at the genomic level. In this paper, fast Global k-means (fast GKM) is developed for clustering the gene expression datasets. Fast GKM is a significant improvement of the k-means clustering method. It is an incremental clustering method which starts with one cluster. Iteratively ...
متن کاملLossless Microarray Image Compression by Hardware Array Compactor
Microarray technology is a new and powerful tool for concurrent monitoring of large number of genes expressions. Each microarray experiment produces hundreds of images. Each digital image requires a large storage space. Hence, real-time processing of these images and transmission of them necessitates efficient and custom-made lossless compression schemes. In this paper, we offer a new archi...
متن کاملA Method to Expand Family of Continuous Distributions based on Truncated Distributions
Abstract: A new method to generate various family of distributions is introduced. This method introduces a new two-parameter extension of the exponential distribution to illustrate its application. Some statistical and reliability properties of the new distribution, including explicit expressions for the moments, quantiles, mode, moment generating function, mean residual lifetime, stochas...
متن کامل